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Related Application 

The subject matter of this application is related to the subject matter in a 
co-pending non-provisional application by the same inventors as the instant 
application entitled, "Applying Term Consistency to an Inequality Constrained 
Interval Global Optimization Problem/' havmg serial number TO BE 
ASSIGNED, and filing date 13 December 2001 (Attomey Docket No. SUN- 
P6446-SPL). 

BACKGROUND 

Field of the Invention 

The present invention relates to performing arithmetic operations on 
interval operands within a computer system. More specifically, the present 
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invention relates to a method and an apparatus for using a computer system to 
solve a global optimization problem including inequality constraints with interval 
arithmetic. 



5 Related Art 

Rapid advances in computing technology make it possible to perform 
trillions of computational operations each second. This tremendous 
computational speed makes it practical to perform computationally intensive tasks 
as diverse as predicting the weather and optimizing the design of an aircraft 

1 0 engine. Such computational tasks are typically performed usmg machine- 

representable floating-point numbers to approximate values of real numbers. (For 
example, see the Listitute of Electrical and Electronics Engineers (IEEE) standard 
754 for binary floating-point numbers.) 

In spite of their limitations, floating-point numbers are generally used to 

1 5 perform most computational tasks. 

One limitation is that machine-representable floating-point numbers have a 
fixed-size word length, which limits their accuracy. Note that a floating-point 
number is typically encoded using a 32, 64 or 128-bit binary number, which 
means that there are only 2^^, 2^"^ or 2^^^ possible symbols that can be used to 

20 specify a floating-point number. Hence, most real number values can only be 
approximated with a corresponding floating-point number. This creates 
estimation errors that can be magnified through even a few computations, thereby 
adversely affecting the accuracy of a computation. 

A related limitation is that floating-point numbers contain no information 

25 about their accuracy. Most measured data values include some amount of error 
that arises from the measurement process itself This error can often be quantified 
as an accuracy parameter, which can subsequently be used to determine the 
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accuracy of a computation. However, floating-point numbers are not designed to 
keep track of accuracy information, whether from input data measurement errors 
or machine roxinding errors. Hence, it is not possible to determine the accuracy of 
a computation by merely examining the floating-point number that results from 
the computation. 

Interval arithmetic has been developed to solve the above-described 
problems. Interval arithmetic represents numbers as intervals specified by a first 
(left) endpoint and a second (right) endpoint. For example, the interval [a, b], 
where a < 6, is a closed, bounded subset of the real numbers, i?, which includes a 
and b as well as all real numbers between a and b. Arithmetic operations on 
interval operands (interval arithmetic) are defined so that interval results always 
contain the entire set of possible values. The result is a mathematical system for 
rigorously bounding numerical errors from all sources, including measurement 
data errors, machine rounding errors and their interactions. (Note that the first 
endpoint normally contains the "infimum", which is the largest number that is less 
than or equal to each of a given set of real nvimbers. Similarly, the second 
endpoint normally contains the "supremxmi'', which is the smallest number that is 
greater than or equal to each of the given set of real numbers.) 

One commonly performed computational operation is to perform 
inequality constrained global optimization to find a global minimum of a 
nonlinear objective function/x), subject to nonUnear inequality constraints of the 
form pi(x) ^ 0 (/=1 . . ,m). This can be accomplished using any members of a set 
of criteria to delete boxes, or parts of boxes that either fail to satisfy one or more 
inequality constraints, or cannot contain the global minimum/^ given the 
inequality constraints are all satisfied. The set of criteria includes: 

(1) the/^6ar-criterion, wherein if fjbar is the smallest upper bound so 
far computed on / within the feasible region defined by the inequality constraints, 
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then any point x for whichy(x) >f_bar can be deleted. Similarly, any box X can 
be deleted if infiJQL)) >f_bar\ 

(2) the monotonicity criterion, wherein if g(x) is the gradient of f 
evaluated at a feasible point x for which all piii) < 0 (/- 1 , . . . then any such 
feasible point x for which g(x) ^ 0 can be deleted. Similarly, any feasible box X 
can be deleted if 0 $ g(X); 

(3) the convexity criterion, wherein if ///^(x) is the z-th diagonal 
element of the Hessian of /, then any feasible point x for all which all jc>,(x) < 0 
(/=!,.. and ///i(x) < 0 (for can be deleted. Similarly, any box X in 
the interior of the feasible region can be deleted if //«(X) < 0 (for /-I,. . and 

(4) the stationary point criterion, wherein points x are deleted using the 
interval Newton technique to solve the John conditions. (The John conditions are 
described in "Global Optimization Using hitervai Analysis" by Eldon R. Hansen, 
Marcel Dekker, Inc., 1992.) 

All of these criteria work best "in the small" when the objective function / 
is approximately linear or quadratic and "active" constraints are approximately 
linear. An active constraint is one that is zero at a solution point. For large 
intervals containing multiple stationary points the above criteria might not 
succeed in deleting much of a given box. In this case the box is split into two or 
more sub-boxes, which are then processed independently. By this mechanism all 
the inequality constrained global minima of a nonlinear objective function can be 
found. 

One problem is applying this procedure to large n-dimensional interval 
vectors (or boxes) that contain multiple local minima. In this case, the process of 
splitting in ^-dimensions can lead to exponential growth in the number of boxes 
to process. 
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It is well known that this problem (and even the problem of computing 
"sharp" bounds on the range of a function of ^-variables over an ^-dimensional 
box) is an "NP-hard" problem. In general, NP-hard problems require an 
exponentially increasing amount of work to solve as the number of mdependent 

variables, increases. 

Because NP-hardness is a worst-case property and because many practical 
engineering and scientific problems have relatively sunple structure, one problem 
is to use this simple structure of real problems to improve the efficiency of 
interval inequality constrained global optimization algorithms. 

Hence, what is needed is a method and an apparatus for using the structure 
of a nonlinear objective function to improve the efficiency of interval inequality 
constrained global optimization software. To this end, what is needed is a method 
and apparatus that efficiently deletes "large" boxes or parts of large boxes that the 
above criteria can only split. 

SUMMARY 

The present invention combines various methods to speed up the process 
of bounding all the inequality constrained global minima of a nonlinear function. 
The combined method uses the structure of the objective ftmction and inequality 
constraints to efficiently delete parts or all of large boxes that would otherwise 
have to be split. One embodiment of the present invention provides a system that 
solves a global inequality constrained optimization problem specified by a 
function / and a set of inequality constraints pi(x) < 0 (i=l, w), wherein/and pj 
are scalar functions of a vector x = {xj, x^, X3, ... During operation, the system 
receives a representation of the function / and the set of inequality constraints, and 
stores the representation in a memory within the computer system. Next, the 
system performs an interval inequality constrained global optimization process to 
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compute guaranteed bounds on a globally minimum value of the function_/(x) 
subject to the set of inequality constraints. During this process, the system applies 
term consistency to a set of relations associated with the global inequality 
constrained optimization problem over a subbox X, and excludes any portion of 
the subbox X that violates any set of these relations. The system also applies box 
consistency to the set of relations associated with the global inequality constrained 
optimization problem over the subbox X, and excludes any portion of the subbox 
X that violates any of the relations. The system also performs an interval Newton 
step for the global inequality constrained optimization problem over the subbox X 
to produce a resulting subbox Y, wherein the point of expansion of the interval 
Newton step is a point x. 

For any function of «-variablesy(x) there are different ways to analytically 
express a component Xj of the vector x. For example, one can write 
Xx) = g{Xj) - h{x), where g(xj) is a term in / for which it is possible to solve 
g(xj) = y for an element of xj, using g ^(y). For each of these rearrangements, if 
a given interval box X is used as an argument of h, then the new interval X/ for 
the y-th component of X, is guaranteed to be at least as narrow as the original, Xj , 

X; -XjH X) where X) =g\h(X)). 

This process is then be iterated using different terms g of the function / 
After reducing any element Xj of the box X to X/, the reduced value can be used 
in X thereafter to speed up the reduction process using other component functions 
if /is a component of the vector function f . 

Hereafter, the notation g{xj) for a term of the function / (x) implicitly 
represents any term of any component function. This eliminates the need for 
additional subscripts that do not add clarity to the exposition. 
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In one embodiment of the present invention, while applying term 
consistency, the system symbolically manipulates an equation to solve for a term, 
g(x }), thereby producing a modified equation g(x }) = ^(x), wherein the term 
g(x )) can be analytically inverted to produce an inverse function g'\y). Next, the 
system substitutes the subbox X into the modified equation to produce the 
equation g(X )) = h{X). The system then solves for X } = g~\h(X)); and intersects 
X ) with the j-th element of the subbox X to produce a new subbox X^. This new 
subbox X^ contains all solutions of the equation within the subbox X. 
Furthermore the size of the new subbox X^ is less than or equal to the size of the 
subbox X. 

In a variation on this embodiment, applying term consistency to the set of 
relations involves applying term consistency to the set of inequality constraints 
Pj{x) < 0 over the subbox X. 

In a variation on this embodiment, applying box consistency to the set of 
relations involves applying box consistency to the set of inequality constraints 
Pt(x) < 0 (i==l, over the subbox X. 

In a variation on this embodiment, during the interval inequality 
constrained global optimization process, the system keeps track of a least upper 
ho\mdf_bar of the functiony(x) at a feasible point x, and removes from 
consideration any subbox X for which^X) >f_bar. In this variation, the system 
applies term consistency to the fjbar inequality ^x) <f_bar over the subbox X. 

In a further variation, the system applies box consistency to HiQfJbar 
inequality Xx) < f_bcir over the subbox X. 

In a variation on this embodiment, if the subbox X is strictly feasible 
(PiiX) < 0 for all /=7, ...,«), the system determines a gradient g(x) of the function 
Xx), wherein g(x) includes components ^^(x) (i=l, ...,n). Next, the system 
removes from consideration any subbox for which g(x) is bounded away from 
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zero, indicating that the subbox does not include an extremum ofJ[x), In this 
variation, the system applies term consistency to each component 
g/(x)=0 (/=1, ...,n)of g(x)=0 over the subbox X. 

In a further variation, the system applies box consistency to each 
component g,(x)=0 (/=1,. . .,«) of g(x)=0 over the subbox X. 

In a variation on this embodiment, if the subbox X is strictly feasible 
(p/(X) < 0 for all ...,«), the system determines diagonal elements 
Hii(x) (i=l,„,,n) of the Hessian of the functionXx). Next, the system removes 
from consideration any subbox X for which a diagonal element Hij{X) of the 
Hessian over the subbox X is always negative, indicating that the function /is not 
convex over the subbox X and consequently does not contain a global minimum 
within the subbox X. In this variation, the system applies term consistency to 
each inequality if„(x) > 0 (/=1, ...,^) over the subbox X. 

In a further variation, the system applies box consistency to each 
inequality Hii(x) > 0 (/=!, over the subbox X. 

In a variation on this embodiment, if the subbox X is strictly feasible 
(Pz(X) < 0 for all i=ly ...,«), while performing the interval Newton step the system 
computes the Jacobian J(x,X) of the gradient of the function / evaluated as a 
function of a point x over the subbox X. Next, the system computes an 
approximate inverse B of the center of J(x,X), and uses the approximate inverse B 
to analytically determine the system Bg(x), wherein g(x) is the gradient of the 
functionXx), and wherein g(x) includes components g,(x) (/=1, 

In a further variation, the system applies term consistency to each 
component (Bg(x)), = 0 (/=1, to solve for the variable over the subbox X. 

In a further variation, the system applies box consistency to each 
component (Bg(x)), = 0 (i=l,...,n)to solve for the variable Xj over the subbox X. 
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In a variation on this embodiment, the system performs the interval 
Newton step on the John conditions. 

In a variation on this embodiment, while performing the interval inequality 
constrained global optimization process, the system linearizes the set of inequality 
constraints to produce a set of linear inequality constraints with interval 
coefficients that enclose the nonlinear inequality constraints. Next, the system 
preconditions the set of linear inequality constraints through additive linear 
combinations to produce a set of preconditioned linear inequality constraints. In 
this variation, the system applies term consistency to the set of preconditioned 
linearized inequality constraints over the subbox X. 

In a further variation, the system applies box consistency to the set of 
preconditioned linearized inequality constraints over the subbox X. 

BRIEF DESCRIPTION OF THE FIGURES 

FIG. 1 illustrates a computer system in accordance with an embodiment of 
the present invention. 

FIG. 2 illustrates the process of compiling and using code for interval 
computations in accordance with an embodiment of the present invention. 

FIG. 3 illustrates an arithmetic imit for interval computations in 
accordance with an embodiment of the present invention. 

FIG. 4 is a flow chart illustrating the process of performing an interval 
computation in accordance with an embodiment of the present invention. 

FIG. 5 illustrates four different interval operations in accordance with an 
embodiment of the present invention. 

FIG. 6 is a flow chart illustrating the process of finding an interval solution 
to a nonlinear equation in accordance with an embodiment of the present 
invention. 
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FIG. 7A presents a portion of a flow chart illustrating the process of using 
term consistency to solve an interval global optimization problem with inequality 
constraints in accordance with an embodiment of the present invention. 

FIG. 7B presents a portion of a flow chart illustrating the process of using 
term consistency to solve an interval global optimization problem with inequality 
constraints in accordance with an embodiment of the present invention. 

FIG. 7C presents a portion of a flow chart illustrating the process of using 
term consistency to solve an interval global optimization problem with inequality 
constraints in accordance with an embodiment of the present invention. 

FIG. 7D presents a portion of a flow chart illustrating the process of using 
term consistency to solve an interval global optimization problem with inequality 
constraints in accordance with an embodiment of the present invention. 

DETAILED DESCRIPTION 

The following description is presented to enable any person skilled in the 
art to make and use the invention, and is provided in the context of a particular 
application and its requirements. Various modifications to the disclosed 
embodiments will be readily apparent to those skilled in the art, and the general 
principles defined herein may be applied to other embodiments and applications 
without departing from the spirit and scope of the present invention. Thus, the 
present invention is not limited to the embodiments shown, but is to be accorded 
the widest scope consistent with the principles and features disclosed herein. 

The data structures and code described in this detailed description are 
typically stored on a computer readable storage medium, which may be any device 
or medium that can store code and/or data for use by a computer system. This 
includes, but is not limited to, magnetic and optical storage devices such as disk 
drives, magnetic tape, CDs (compact discs) and DVDs (digital versatile discs or 
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digital video discs), and computer instruction signals embodied in a transmission 
medium (with or without a carrier wave upon which the signals are modulated). 
For example^ the transmission medium may include a communications network, 
such as the Internet. 

5 

Compttter System 

FIG. 1 illustrates a computer system 1 00 in accordance with an 
embodiment of the present invention. As illustrated in FIG. 1, computer system 
100 includes processor 102, which is coupled to a memory 112 and a to peripheral 

10 bus 110 through bridge 106. Bridge 106 can generally include any type of 
circuitiy for coupling components of computer system 100 together. 

Processor 1 02 can include any type of processor, including, but not limited 
to, a microprocessor, a mainframe computer, a digital signal processor, a personal 
organizer, a device controller and a computational engine within an appliance. 

15 Processor 102 includes an arithmetic unit 104, which is capable of performing 
computational operations using floating-point numbers. 

Processor 1 02 communicates with storage device 1 08 through bridge 1 06 
and peripheral bus 110. Storage device 108 can include any type of non-volatile 
storage device that can be coupled to a computer system. This includes, but is not 

20 limited to, magnetic, optical, aud magneto-optical storage devices, as well as 
storage devices based on flash memory and/or battery-backed up memory. 

Processor 102 communicates with memory 112 through bridge 106. 
Memory 1 12 can include any type of memory that can store code and data for 
execution by processor 102. As illustrated in FIG. 1, memory 1 12 contains 

25 computational code for intervals 114. Computational code 114 contains 

instructions for the interval operations to be performed on individual operands, or 
interval values 115, which are also stored within memory 112. This 
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computational code 1 1 4 and these interval values 1 1 5 are described in more detail 
below with reference to FIGs. 2-5. 

Note that although the present invention is described in the context of computer 
system 100 illustrated in FIG. 1, the present invention can generally operate on 
any type of computing device that can perform computations involving floating- 
point numbers. Hence, the present invention is not limited to the computer system 
100 illustrated in FIG. 1. 

Compiling and Using Interval Code 

FIG. 2 illustrates the process of compiling and using code for interval 
computations in accordance with an embodiment of the present invention. The 
system starts with source code 202, which specifies a number of computational 
operations involving intervals. Source code 202 passes through compiler 204, 
which converts source code 202 into executable code form 206 for interval 
computations. Processor 102 retrieves executable code 206 and uses it to control 
the operation of arithmetic unit 104, 

Processor 102 also retrieves interval values 115 from memory 1 12 and 
passes these interval values 1 1 5 through arithmetic unit 1 04 to produce results 
212. Results 212 can also include interval values. 

Note that the term "compilation" as used in this specification is to be 
construed broadly to include pre-compilation and just-in-time compilation, as well 
as use of an interpreter that interprets instructions at run-time. Hence, the term 
"compiler" as used in the specification and the claims refers to pre-compilers, 
just-in-time compilers and interpreters. 
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Arithmetic Unit for Intervals 

FIG. 3 illustrates arithmetic unit 1 04 for interval computations in more 
detail accordance with an embodiment of the present invention. Details regarding 
the construction of such an arithmetic unit are well known in the art. For 
example, see U.S. Patent Nos. 5,687,106 and 6,044,454. Arithmetic unit 104 
receives intervals 302 and 312 as inputs and produces interval 322 as an output. 

In the embodiment illustrated in FIG. 3, interval 302 includes a first 
floating-point number 304 representing a first endpoint of interval 302, and a 
second floating-point number 306 representing a second endpoint of interval 302. 
Similarly, interval 312 includes a first floating-point number 314 representing a 
first endpoint of interval 312, and a second floating-point number 316 
representing a second endpoint of interval 312. Also, the resulting interval 322 
includes a first floating-point number 324 representing a first endpoint of interval 
322, and a second floating-point number 326 representing a second endpoint of 
interval 322. 

Note that arithmetic unit 1 04 includes circuitry for performing the interval 
operations that are outlined in FIG. 5. This circuitry enables the interval 
operations to be performed efficiently. 

However, note that the present invention can also be applied to computing 
devices that do not include special-purpose hardware for performing interval 
operations. In such computing devices, compiler 204 converts interval operations 
into a executable code that can be executed using standard computational 
hardware that is not specially designed for interval operations. 

FIG. 4 is a flow chart illustrating the process of performing an interval 
computation in accordance with an embodiment of the present invention. The 
system starts by receiving a representation of an interval, such as first floating- 
point number 304 and second floating-point number 306 (step 402). Next, the 
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system performs an arithmetic operation using the representation of the interval to 
produce a result (step 404). The possibilities for this arithmetic operation are 
described in more detail below with reference to FIG. 5. 

Interval Operations 

FIG. 5 illustrates four different interval operations in accordance with an 
embodiment of the present invention. These interval operations operate on the 
intervals X and 7. The interval includes two endpoints, 

X denotes the lower bound of X, and 
X denotes the upper bound of X. 

The interval X is a closed subset of the extended (including -oo and +oo) real 
numbers i?* (see line 1 of FIG. 5). Similarly the interval 7 also has two endpoints 
and is a closed subset of the extended real numbers i?* (see line 2 of FIG, 5). 

Note that an interval is a point or degenerate interval ifX- [x, x]. Also 
note that the left endpoint of an interior interval is always less th^ or equal to the 
right endpoint. The set of extended real numbers, i? * is the set of real numbers, R, 
extended with the two ideal points negative infinity and positive infinity: 

i?* = i?U {-co} U {+oo}. 

In the equations that appear in FIG. 5, the up arrows and down arrows 
indicate the direction of rounding in the next and subsequent operations. Directed 
rounding (up or down) is applied if the result of a floating-point operation is not 
machine-representable. 

The addition operation X+ 7 adds the left endpoint of JTto the left 
endpoint of Y and rounds down to the nearest floating-point number to produce a 
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resulting left endpoint, and adds the right endpoint of X to the right endpoint of Y 
and rounds up to the nearest floating-point number to produce a resulting right 
endpoint. 

Similarly, the subtraction operation X- 7 subtracts the right endpoint of Y 
from the left endpoint of X and rounds down to produce a resulting left endpoint, 
and subtracts the left endpoint of Y from the right endpoint of X and rounds up to 
produce a resulting right endpoint. 

The multiplication operation selects the minimum value of four different 
terms (rounded down) to produce the resulting left endpoint. These terms are: the 
left endpoint of multiplied by the left endpoint of Y; the left endpoint ofX 
multiplied by the right endpoint of Y; the right endpoint of multiplied by the left 
endpoint of 7; and the right endpoint of X multiplied by the right endpoint of Y. 
This multiplication operation additionally selects the maximum of the same four 
terms (rounded up) to produce the resulting right endpoint. 

Similarly, the division operation selects the minimum of four different 
terms (rounded down) to produce the resulting left endpoint. These terms are: the 
left endpoint of X divided by the left endpoint of 7; the left endpoint of X divided 
by the right endpoint of 7; the right endpoint of X divided by the left endpoint of 
7; and the right endpoint of X divided by the right endpoint of 7 This division 
operation additionally selects the maximum of the same four terms (rounded up) 
to produce the resulting right endpoint. For the special case where the interval 7 
includes zero, X/7is an exterior interval that is nevertheless contained in the 
interval R *. 

Note that the result of any of these interval operations is the empty interval 
if either of the intervals, X or 7, are the empty interval. Also note, that in one 
embodiment of the present invention, extended interval operations never cause 
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undefined outcomes, which are referred to as "exceptions" in the IEEE 754 
standard. 

Term Consistency 

FIG. 6 is a flow chart illustrating the process of solving a nonlinear 
equation through interval arithmetic and term consistency in accordance with an 
embodiment of the present invention. The system starts by receiving a 
representation of a nonlinear equation/x) = 0 (step 602), as well as a 
representation of an initial box with X WithXj an element of X (step 604). Next, 
the system symboUcally manipulates the equation^x) = 0 into a form g(xj)-h{x)= 
0, wherein the term g{xj) can be analytically inverted to produce an inverse 
function g'^iy) (step 606). 

Next, the system substitutes the initial box X into h{X) to produce the 
equation g(X}) = h(X) (step 608). The system then solves forX^ = g'\h(X)) 
(step 610). The resulting interval is then intersected with the initial interval Xj 
to produce a new interval X/ (step 612). 

At this point, if X/ is empty, the system can terminate. Otherwise, the 
system can perform further processing. This further processing involves saving Xj 
by setting =Xjmd also, setting =Xj^ (step 614). Next, ifw(X^^^) is 
sufficiently reduced at step 616, the system retums to step 606 for another 
iteration of term consistency on another term g ofJ{x). Otherwise, the system 
terminates. 

Examples of Applying Term Consistency 

For example, suppose X^) -x^ -x + 6. We can define g(x) = x^ and 
h(x)=x-6. Let Z= [-10,10]. Theproceduralstepis(jry = X-(5= [-16,4]. Since 
(X') must be non-negative, we replace this interval by [0,4], Solving forX', we 
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obtain = ± [0,2]. In replacing the range of h{x) (i.e., [-16,4]) by non-negative 
values, we have excluded that part of the range h(x) that is not in the domain of 

g(x) = x^. 

Suppose that we reverse the roles of g and h and use the iterative step 
5 hiX') = g{X), That isX'^6= X\ We obtain = [6, 1 06] . hitersecting this 
result with the interval [-10,10], of interest, we obtain [6,10]. This interval 
excludes the set of values for which the range of g{X) is not in the intersection of 
the domain of h{X) with X. 

Combining these results, we conclude that any solution of 
10 fiX) = giX) ~ hiX) = 0 that occurs mX= [-10,10] must be in both [-2,2] and 
[6,10]. Since these intervals are disjoint, there can be no solution in [-10,10]. 

In practice, if we already reduced the interval from [-10,10] to [-2,2] by 
solving for g, we use the narrower interval as input when solving for h. 

This example illustrates the fact that it can be advantageous to solve a 
15 given equation for more than one of its terms. The order in which terms are 

chosen affects the efficiency. Unfortunately, it is not known how best to choose 
the best order. 

Also note that there can be many choices for g(x). For example, suppose 
we use term consistency to narrow the interval bound JSf on a solution of 

20 /(x) = ca^ + 6x + c = 0. We can let g{x) = bx and compute X' = -{aX^ + c)/b or 
we can let g{x) = ax^ and compute ' = ± [-(6Z+c)/a]^^^ We can also separate x^ 
into x^ * x^ and solve for one of the factors X' = ± [-(bX+c)/(aX^)y^. 

In the multidimensional case, we may solve for a term involving more than 
one variable. We then have a two-stage process. For example, suppose we solve 

25 for the term l/(x+y) from the functionX^,^^) = l/{x-^y) - h(x,y) = 0. Let 

xeX^ [1,2] and J/ 6 7= [0.5,2], Suppose we find that /^(Jt; 7) = [0.5,1]. Then 
l/(x+y) E [0.5,1] so x-^y e [1,2]. Now we replace j; by Y= [0.5,2] and obtain the 
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bound [-1,L5] onX Intersecting this interval with the given boundX== [1,2] on 
X, we obtain the new bound Jf' = [1,1.5], 

We can use X' to get a new bound on h; but this may require extensive 
computing if /z is a complicated function; so suppose we do not. Suppose that we 
do, however, use this bound on our intermediate result x + = [1,2]. Solving for 
as [1,2] -X\ we obtain the bound [-0.5,1]. Intersecting this interval with 7, we 
obtain the new bound Y' = [0.5,1] on>'. Thus, we improve the bounds on both x 
mdy by solving for a single term of / 

The point of these examples is to show that term consistency can be used 
in many ways both alone and in combination with the interval Nev^on algorithm 
to improve the efficiency with which roots of a single nonlinear equation can be 
computed. The same is true for systems of nonlinear equations. 

Inequality Constrained Interval Global Optimization 

FIGs. 7A-7D collectively present a flow chart illustrating the process of 
solving an interval global optimization problem with inequality constraints in 
accordance with an embodiment of the present invention. Generally, we seek a 
solution in a single box specified by the user. However, any number of boxes can 
be initially specified. 

The boxes can be disjoint or overlap. However, if they overlap, a 
minimum at a point that is common to more than one box is separately found as a 
solution in each box containing it. In this case, computing effort is wasted. If the 
user does not specify an initial box or boxes, we use a default box. The process 
finds the global minimum in the set of points formed by the set of boxes. We 
assume these initial boxes are placed in a list Lj of boxes to be processed. 

Suppose the user of the process knows a point xjbar that is guaranteed to 
be feasible. If so, we use this point to compute an initial upper hoxmdfjbar on the 
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global minimum/^. If x_bar cannot be represented exactly on the computer, the 
system forms a representable interval vector X containing xjbar. We evaluate 
/(X) and obtain [lower boundy(X),upper bound^X)]. Even if rounding and/or 
dependence are such that X cannot be numerically proven to be certainly feasible, 
we rely upon the user and assume that X contains a feasible point. Therefore, we 
SQtfjbar equal to the upper boimd of/(X). 

Also the user may know an upper hound fjbar on/* even though he may 
not know where (or even if) / takes on such a value in the feasible region defined 
by the inequality constraints. If so, we set fjyar equal to this known bound. If the 
known bound is not representable on the computer, the system rounds the value 
up to a larger value that is representable. 

If no feasible point is known and no upper bound on/* is known, we set 
fJbar = +00. The user must specify a box size tolerance ex and a function width 
tolerance Sf, 

In the system, nonlinear fttnctions are often linearized over a box X using 
Taylor expansion. However, use of linearization is generally ineffective if X is 
wide. Four different sub-procedures in the system use linearization. The system 
uses a "linearization test" to decide if a given sub-procedure should be used for a 
given box. Each of the four sub-procedures uses a separate test of the same kind. 
In each case, a criterion for "success" is defined. The symbol wr denotes the 
width of the largest box for which success was achieved. The symbol wj denotes 
the width of the smallest box for which success was not achieved. A given sub- 
procedure is applied for a box X whenever w(X) < (wr + wj)/2. For each sub- 
procedure, the system initially sets wr = 0 and wj = w{X^^\ where X^^^ is the 
initial box. In addition, the system specifies a hound fJbar if one is known. Note 
that the four sub-procedures referred to above are: (1) Newton applied to the 
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gradient; (2) Newton applied to the John conditions; (3) Imearization of the 
constraints; and (4) linearization of f(x) < fj)ar. 

The steps of the process are performed in the order given except as 
indicated by branching. 

First, for each box in the list Zy, the system applies term consistency to 
each of the inequality constraints pi{x) < 0 (/-7, ..,,m) (step 701). 

lffJbar<+oo, then for each box in Lj, the system applies term consistency 
to the inequality f<f_bar (step 702). 

If Z; is empty, the system goes to step 742. Otherwise, the system selects 
(for the next box X to be processed) the box in 1/ for which the lower bound of 
XX) is smallest. For later reference, the system denotes this box by X^^\ The 
system also deletes X from Lj (step 703). 

The system applies term consistency over X to each constraint inequality. 
If X is deleted, the system goes to step 703. The system skips this step if X has 
not changed since step 701 . (step 704). 

Next, the system computes an approximation x for the center m(X) of X. 
If the upper bound oifiX)>f_bar, the system goes to step 708 (step 705). 

For future reference, the system denotes the box X by X^^l Next, the 
system does a constrained line search to try to reduceX.6ar (step 706). 

lff_bar was not reduced in step 706, the system goes to step 709 
(step 707). 

Next, the system applies term consistency to the inequality Xx) ^fjyar. If 
X is deleted, the system goes to step 703 (step 708). 

If w(yi)<ex and w[/(X)]< ep, the system puts X in list L2. Otherwise, if X 
is sufficiently reduced relative to the box X^^^ defined in step 703, the system puts 
X in 1/ and goes to step 703 (step 709). We say that a box X is sufficiently 
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reduced if any component of X is reduced by an amount that is at least a fraction 
(say 0.25) of the width of the widest component of X. 

Next, the system applies box consistency to each inequality constraint. If 
f_bar <+oo^ the system also applies box consistency to the inequality /x)^Aar. 
If X is deleted, the system goes to step 703 (step 710). 

If the upper bound of p^Qi)>0 for any i=l (i.e., if X is not certainly 
strictly feasible), the system goes to step 726 (step 711). 

Next, the system applies term consistency to gi(x)=0 for where g 

is the gradient of the objective function / If the result for any /=7,...,w is empty, 
the system goes to step 703 (step 712). Note that the steps 712 through 725 do not 
use inequality constraints because none are active for the current box X. 

Otherwise, the system applies term consistency to the relation //„(x)>0 for 
i=l,.,.,n, where H„ is a diagonal element of the Hessian of / If the result is empty, 
the system goes to step 703 (step 713). 

Next, the system repeats step 709 (step 714). 

The system then applies box consistency to g^^O for If the result 

is empty, the system goes to step 703 (step 715). 

Next, the system applies box consistency to Hu(x)>0 for i^l,„.,n. If the 
result is empty, the system goes to step 703 (step 716). 

Next, the system repeats step 709 (step 717). 

The system then uses a criterion w(X) > {w}+wr}I2 to decide if a Newton 
step should be applied to solve g=0. If not, the system goes to step 726 
(step 718). Note that, wi denotes the width of the smallest box for which 

I T 

M = BJ(x,X) is irregular. If M is regular for a given box, wr denotes the width 
of the largest box for which M has been shown to be regular. 

The system generates the interval Jacobian J(x,X) of the gradient g and 
computes the approximate inverse B of the center of J(x,X). The system also 
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applies one step of an interval Newton method to solve g=0. If the result is 
empty, the system goes to step 703 (step 719). 

Next, the system repeats step 709 (step 720). 

The system then uses the matrix B found m step 719 to obtain Bg in 
analytic form. The system applies term consistency to solve the f-th equation of 
Bg-0 for the z-th variable x, for /=7,...,^. If the result is empty, the system goes to 
step 703 (step 721). 

Next, the system repeats step 709 (step 722). 

The system uses box consistency to solve the z-th equation of Bg (as 
obtained in step 721) for the z-th variable for /=7,...,w. If the result is empty, the 
system goes to step 703 (step 723). 

Next, the system repeats step 709 (step 724). 

The system uses the matrix B found in step 719 in a Newton Step to try to 
reduce the upper hound f_bar (step 725). A line search can be performed as 
follows. Suppose we evaluate the gradient g(x) of/x) at a point x. Note that / 
decreases (locally) in the negative gradient direction from x. A simple procedure 
for finding a point where /is small is to search along this half-line. Let x be the 
center of the current box. Define the half-line of points y(a)=x-ag(x) where a^O. 
We now use a standard procedure for finding an approximate minimum of the 
objective fimction / on this half-line. We first restrict our region of search by 
determining the value a' such that y(a')^x-a'g is on the boundary of the current 
box X, and search between x and x \ We use the following procedure. Each 
application of the procedure requires an evaluation of f. Procedure: If/x ')<fix), 
replace x by (x+x y2 Otherwise, we replace x' by (x+x')/2. 

Next, the system computes an approximation x for the center m(X) of X. 
If J{x)>f_bar, the system goes to step 703 (step 726). 
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The system skips this step and goes to step 732 if X = \ the same box 
for which a line search was done in step 706. Otherwise, the system does a line 
search to try to reduce fjbar. If fjbar is not reduced, the system goes to step 732 
(step 727). 

For future reference, the system denotes X by X^^l The system then uses a 
linearization test to decide whether to linearize and "solve" the inequality 
J(x)<fjbar. If this condition is not satisfied, the system goes to step 732 
(step 728). 

The system uses a Hnear method to try to reduce X using the inequality 
A^WJ>^^' IfX is deleted, the system goes to step 703. Otherwise, if this 
application of the linear method does not sufficiently reduce the box X^^\ the 
system goes to step 73 1 (step 729). 

The system uses a quadratic method to try to reduce X using the inequality 
J{x)<f_bar, IfX is deleted, the system goes to step 703 (step 730). 

Next, the system repeats step 709 (step 731). 

The system uses a criterion similar to that in step 7 1 8 to decide whether to 
linearize and "solve" the inequality constraints. If the procedure indicates that the 
linearization should not be done, the system goes to 739 (step 732). 

Next, the system selects the inequality constraints to be solved in 
linearized form, and possibly adds to this set the inequality Xx)4/16ar. Note that 
the selection process removes from consideration any inequality constraints that 
are not sufficiently violated. If no inequalities are selected, the system goes to 
step 739. Otherwise, the system linearizes the resulting set of inequalities, and 
solves the resulting set of linear inequalities. If the solution set is empty, the 
system goes to step 703 (step 733). 

Next, the system repeats step 709 (step 734). 
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The system then uses the preconditioning matrix B formed at step 733 to 
analytically precondition the set of inequalities that were selected for use in step 
733. The system also uses term consistency to solve each of the preconditioned 
inequalities. In so doing, each inequality is solved for the same (single) variable 
for which the linearized inequality was solved in step 733 (step 735). 

Next, the system repeats step 709 (step 736). 

The system uses box consistency to solve the same inequalities for the 
same variables as in step 735 (step 737). 

Next, the system repeats step 709 (step 738). 

The system uses a linearization test to decide whether to solve the John 
conditions. If not, the system goes to step 742 (step 739). 

The system modifies the John conditions by omitting those constraints pt 
for which (X)<0 (since they are not active in X). The system applies one pass 
of the interval Newton method to the (modified) John conditions. If the result is 
empty, the system goes to step 703 (step 740). 

Next, the system repeats step 709 (step 741). 

In various previous steps, gaps may have been generated in components of 
X. If so, the system merges any of these gaps that overlap. The system then splits 
X, and places the resulting subboxes in Z/ and goes to step 703 (step 742). 

If f_bar<+oo, the system applies term consistency to J(x)<f_bar for each 
box in the list L^. The system denotes those that remain by X^^\,.., X^^^ where s is 
the number of boxes remaining. The system also determines 

l = mmf(^'^) ^ = max7(^'^) (step 743). 

\^^s \<i<s 

Finally, the system terminates (step 744). 
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After termination, w{X)<Sx and w(J{X))< Sf for each remaining box X in 
the list ^2. Also, 

F<f{x)<F 

for every point x in all remaining boxes. If, after termination, y^iar < we 
know there is a feasible point in the initial box(es). Therefore, we know that 

F</*<min{7,F}. 

If, after termination, /_6ar = then we have not found a certainly 
feasible point. There may or may not be one in X^^l However, we know that if a 
feasible point x does exist in X^^\ then 

F</(x)<F. 

Suppose a feasible point exists. If our algorithm fails to find a certainly 
feasible point, then it does not produce an upper ho^m^ifJ)ar and cannot use the 
relation / <fjyar. In particular, it cannot delete local minima whereXx)>/*. In 
this case, all local minima are contained in the set of output boxes. 

If all of the initial box X^^^ is deleted by our process, then we have proved 
that every point in X^^^ is infeasible. Suppose that every point in X^^^ is infeasible. 
Our process may prove this to be the case. However, we delete a subbox ofX<^> 
only if it is certainly infeasible. Rounding errors and/or dependence may prevent 
us from proving certain infeasibility of an infeasible subbox. Increased 
wordlength can reduce rounding errors and decreasing Sx can reduce the effect of 
dependence by causing subboxes to eventually become smaller. However, neither 
effect can completely be removed. 
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Suppose fjbar = after termination and X^^^ has not been entirely 
eliminated. It may still be possible either to compute/_Z?^r < <» or to delete all of 
X^^^ by reducing the values of ex and Sf and continuing to apply the process. To 
try to do so, we need only to reduce these tolerances and move the boxes from list 
5 Z2 to list L7. We can then restart the algorithm from the beginning with or without 
use of increased precision. 

Note that steps 712 through 725 are essentially the same as corresponding 
steps in the process for unconstrained optimization. This is because these steps are 
applied to a box that is certainly feasible. 
10 In our process, we avoid using more complicated procedures until the 

simpler ones no longer make sufficient progress in reducing the current box. For 
example, we delay use of the John conditions until all other procedures become 
unproductive. 

We avoid using procedures that use Taylor expansions until we have 
15 evidence that expanded forms provide sufficiently accurate approximations to 
functions. 

Inequality constraints are often simple relations of the foxmxi^bi oxXi^Ui 
Such constraints serve to determine the initial box X^^l Therefore, they are 
satisfied throughout X^^l Such constraints are omitted when applying any 
20 procedure designed to eliminate infeasible points. See steps 701, 704, 710 and 
733. 

In step 706 we use a line search to try to xcdnoQ fjbar. This involves 
evaluating the gradient of f. We can avoid this evaluation by simply checking if 
the midpoint x of the box is feasible and, if so, using /x) as a candidate value for 
25 fjbar. However, it helps to have a finite value offjbar early, so the line search is 
worth doing whenX.6ar=H-«>. Step 727 also uses a line search. It is less important 
here because a finite value offjbar is likely to be computed in step 706. If there 
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are a large number of constraints, then evaluating the gradient is not a dominant 
part of the work to do the line search. 

Experience has shown that efficiency is enhanced if the subbox X to be 
processed is chosen to be the one for which infif^S)) is smallest among all 
candidate subboxes. This tends to cause a smaller value of fjbar to be computed 
early in the algorithm. Therefore, we return to step 703 to choose a new subbox 
whenever the current box has substantially changed. 

Suppose we find that pi(X)<0 for some value of i and some box X. Then 
Pj{X ')<0 for any X tX. Therefore, we record the fact that p,(X) <0 so that we 
need not evaluate p,(X ') for any X t X. 

It is possible that the procedures in step 719, 721, 723 or 740 prove the 
existence of a solution to the optimization problem. If so, the user can be 
informed of this fact. Such a solution may be local or global. 

The foregoing descriptions of embodiments of the present invention have 
been presented only for purposes of illustration and description. They are not 
intended to be exhaustive or to limit the present invention to the forms disclosed. 
Accordingly, many modifications and variations will be apparent to practitioners 
skilled in the art. For example, although the present invention describes the use of 
derivatives in certain situations, it is often possible to use slopes instead of 
derivatives. 

Additionally, the above disclosure is not intended to limit the present 
invention. The scope of the present invention is defined by the appended claims. 
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